Quality Control for RNA-Seq (QuaCRS): An Integrated Quality Control Pipeline
نویسندگان
چکیده
QuaCRS (Quality Control for RNA-Seq) is an integrated, simplified quality control (QC) system for RNA-seq data that allows easy execution of several open-source QC tools, aggregation of their output, and the ability to quickly identify quality issues by performing meta-analyses on QC metrics across large numbers of samples in different studies. It comprises two main sections. First is the QC Pack wrapper, which executes three QC tools: FastQC, RNA-SeQC, and selected functions from RSeQC. Combining these three tools into one wrapper provides increased ease of use and provides a much more complete view of sample data quality than any individual tool. Second is the QC database, which displays the resulting metrics in a user-friendly web interface. It was designed to allow users with less computational experience to easily generate and view QC information for their data, to investigate individual samples and aggregate reports of sample groups, and to sort and search samples based on quality. The structure of the QuaCRS database is designed to enable expansion with additional tools and metrics in the future. The source code for not-for-profit use and a fully functional sample user interface with mock data are available at http://bioserv.mps.ohio-state.edu/QuaCRS/.
منابع مشابه
A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium
performance of RNA-seq across laboratories and to test different sequencing platforms and data analysis pipelines. Here we report a multisite, cross-platform analysis of RNA-seq measurement performance in a controlled setting. We sequenced commercially available reference RNA samples spiked with synthetic RNA from the External RNA Control Consortium. Two distinct samples were assessed individua...
متن کاملGrape RNA-Seq analysis pipeline environment
MOTIVATION The avalanche of data arriving since the development of NGS technologies have prompted the need for developing fast, accurate and easily automated bioinformatic tools capable of dealing with massive datasets. Among the most productive applications of NGS technologies is the sequencing of cellular RNA, known as RNA-Seq. Although RNA-Seq provides similar or superior dynamic range than ...
متن کاملSystems Biology Analyses in Chicken: Workflow for Transcriptome and ChIP-Seq Analyses Using the Chicken Skin Paradigm.
With advances in molecular biology, various biological phenomena can now be explored at higher resolution using mRNA sequencing (RNA-Seq) and chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-Seq), two powerful high-throughput next-generation sequencing (NGS) technologies. While methods are used widely in mouse, human, etc., less information is available in other animal...
متن کاملCAM: A quality control pipeline for MNase-seq data
Nucleosome organization affects the accessibility of cis-elements to trans-acting factors. Micrococcal nuclease digestion followed by high-throughput sequencing (MNase-seq) is the most popular technology used to profile nucleosome organization on a genome-wide scale. Evaluating the data quality of MNase-seq data remains challenging, especially in mammalian. There is a strong need for a convenie...
متن کاملDr.seq2: A quality control and analysis pipeline for parallel single cell transcriptome and epigenome data
An increasing number of single cell transcriptome and epigenome technologies, including single cell ATAC-seq (scATAC-seq), have been recently developed as powerful tools to analyze the features of many individual cells simultaneously. However, the methods and software were designed for one certain data type and only for single cell transcriptome data. A systematic approach for epigenome data an...
متن کامل